May 8, 2025
Past
Present
FUTURE
Photo taken: 1932 in Norton Canon, Herefordshire
Photo: 2025 from Google Street View
Photo taken: April 2025 at the GISRUK conference in Bristol
With a year in Salamanca
Photo taken: June 2007 from my flat at the time in Salamanca
Influential book “SEWTHA”, freely available at withouthotair.com (MacKay 2009)
Blog post in The Oil Drum
Source: https://etheses.whiterose.ac.uk/id/eprint/5027/
Photo taken: February 2021, Sugarwell Hill, Leeds
First proper job (🙏Mark Birkin) and first Leeds-based paper (Lovelace et al. 2014)
Source: CyclingUK (formerly CTC) response to government’s Cycling Delivery Plan consultation, available online at cyclinguk.org.
Work on the economic benefits of cycling nationwide with James Woodcock and Fiona Crawford (Crawford and Lovelace 2015)
Source: article in practitioner magazine (Lovelace 2016).
First Propensity to Cycle Tool paper published in an academic journal (Lovelace et al. 2017)
Source: leeds.ac.uk front page, 2017-03-17
Source: results2021.ref.ac.uk (Lovelace et al. 2023)
Fellowship in collaboration with 10 Downing Street, ONS, Data Science Campus, ADRUK, ESRC from November 2021 until April 2023
Source: “Packaging Code and Data for Reproducible Research: A Case Study of Journey Time Statistics.” Environment and Planning B Botta et al. (2024).
2 year contract in the Civil Service from January 2023
My roles:
Source: photo taken May 2023 at the Department for Transport’s Data Science for Transport conference
Active Travel England - Alan Turing Institute grant
Photo credit: Danny Williams
Reproducibility is a continuous variable (Peng 2011)
Source: Raff (2023)
Time
Know-how
Lack of permission
Software is not open
Data is not open access
Someone might use it in unethical ways
Someone might “steal” the work
Lovelace, Tennekes, and Carlino (2022)
Illustration of the ClockBoard zoning system used to visualize a geographically dependendent phenomena: air quality, measured in mass of PM10 particles, measured in micrograms per cubic meter, from the London Atmospheric Emissions Inventory (LAEI). The facets show the data in spatial grid available from the LAEI, facet Am and aggregated to London boroughs B, to ClockBoard zones covering all the input data shown in C, and ClockBoard zones clipped by the administrative boundary of Greater London in D.
Premise: A key reason for reproducibility is generalisability.
options(timeout = 600) # 10 minutes
u1 = "https://movilidad-opendata.mitma.es/estudios_basicos/por-distritos/viajes/ficheros-diarios/2024-03/20240301_Viajes_distritos.csv.gz"
f1 = basename(u1)
if (!file.exists(f1)) {
download.file(u1, f1)
}
drv = duckdb::duckdb("daily.duckdb")
con = DBI::dbConnect(drv)
od1 = duckdb::tbl_file(con, f1)Credit: Egor Kotov
The package has been onboarded to rOpenSpain public benefit data science community (see ropenspain.github.io)
“In essence ‘open access’ goes beyond ‘open source’ in that users are not only given the option of viewing (potentially indecipherable) source code, but are encouraged to do so, with measures taken in the software itself, and the community that builds it, to make it more user-friendly.””
Source: (Lovelace, Parkin, and Cohen 2020)